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(54) DATA STORAGeRETRIEVAL SYSTEM 

(57) Conventional information storage systems are 
subject to numerous practical constraints such as conti- 
guity in the physical locations of blocks and the require- 
ment that storage blocl^ be created in advance. 
Information retrieval in these systems has required the 
creation of indices, which take a long time to generate, 
and the structure of these systems makes them prone 
to deadlock since the indices are updated and the range 
of exclusion broadened when the referent information is 
modified. This invention utilizes the random access 
facilities of semiconductors to achieve high speeds and 
minimize the maintenance load. This Invention intro- 
Figure 4 



duces location tables and alternate-key tables to 
replace these indices. It also stores muKiple records in a 
single block and can handle variable-length records and 
spanned records. The location tables manage the stor- 
age blocks. An alternate-key block is made up of a sub- 
stitute key and its block number and the primary key 
value, either of which may be used to retrieve a target 
record by searching this table. Binary search is a well- 
known high-speed method of querying tables, but other 
methods may be used as well. 



Block 



Block 
No. 



o 

CO 

o 

CO 
CM 
O) 

o 

Q. 

lU 



Primary 



Froa To 



Overt low 



Fran To 



Record 1 



Prin 
key 



Alt 

key 



Record t 



Prs 
key 



All 
key 



Overflow 

block 

address 



Priniod by Xerox (UK) Business Servioes 
2.16.7/3.6 



1 



EP 0 923 030 A1 



2 



Description 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0001 ] The invention relates to the field of information 
storage and retrieval, and particularly to the storage and 
reading of data with computers and also particularly to 
data management in that it provides data storage and 
retrieval at high speeds, high ratios of data to storage 
capacity and greatly reduced maintenance time. 

2. Description of Related Art 

[0002] Conventional information storage systems are 
subject to numerous practical constraints such as conti- 
guity In the physical location of blocks and the require- 
ment that storage blocks be created in advance. 
Moreover, in most systems random access requires the 
creation of indices, which take a long time to generate, 
and the structure of these systems makes them prone 
to deadlock since the indices are updated and the range 
of exclusion broadened when the referent information is 
modified. Although direct access systems do not use 
indices, these systems relate record keys with storage 
locations by means of special programs known as rand- 
omizing routines. In practice they do not allow for 
sequential access and exhibit low storage efficiency 
when compared with methods that use indices. 

SUMMARY OF THE INVENTION 

[0003] Since conventional information storage and 
retrieval systems presumed their data would be stored 
on magnetic disK in certain respects their shortcomings 
are unavoidable. On the other hand, the price of semi- 
conductors has fallen remarkably and conditions for 
using semiconductors as memory devices are falling 
into place. Semiconductors do not necessitate physical 
movement or rotation and ^enable high-speed storage 
and reading even if addresses are not contiguous. The 
construction of a memory device taking advantage of 
these properties and comprising semiconductors alone 
or semiconductors and random access memory media 
would enable high-speed storage and retrieval process- 
ing. A semiconductor may be used as the primary mem- 
ory device or may be provided to the invention as an 
external memory device. Storing information sequen- 
tially in blocks, using overflow blocks for storage when 
the insertion of information causes an overflow, using 
location tables or alternate-key blocks that manage stor- 
age blocks rather than using indices, and performing 
retrievals from these location tables or alternate-key 
blocks enables high-speed storage and reading, 
improves the efficiency of information storage and mini- 
mizes the occurrence of deadlock. 
[0004] Location tables and alternate-key tables are 



introduced in place of indices. Records are not stored 
independently; rather, multiple records are stored in a 
single block. FIG. 4 illustrates the structure of a block 
The FROM and TO segments in FIG. 4 represent the 

5 minimum key value and maximum key value of the 
block, respectively, but neither is absolutely necessary 
and the invention may be implemented with either one 
alone. When using long records, a single record may be 
matched to a single block, and a format in which a sin- 
to gle record is stored over two or more blocks (spanned 
records) is also possible. All primary blocks in a single 
file and all overflow blocks in a single file are of equal 
lengths, facilitating region management. Storage in 
blocks allows block detection by means of fixed-length 

15 operations even when variable-length records are 
stored because the block units are of a fixed length. The 
location table secures the required contiguous region in 
advance. A single record (entry) in a location table man- 
ages a single primary block in the data record storage 

20 region. Although it is also possible to implement meth- 
ods of managing multiple primary blocks, since prinnary 
blocks may be of any given size it is simpler not to man- 
age multiple blocks if the implementation allows modifi- 
cation of the size of blocks. Blocks are classified into 

25 primary blocks and overflow blocks, and only the pri- 
mary blocks are managed in the location table. Records 
are stored first of all in primary blocks. If storage in a 
block is rendered impossible due to insertion of a 
record, a single overflow block is allocated to that block. 

30 If that overflow block is not sufficient for storage, one 
more overflow block is allocated. Overflow blocks are 
managed as dependent blocks of their primary block; 
they are not managed in the location table but only by 
pointing from the primary block. 

35 [0005] Since overflow blocks are not managed in tiie 
location table, records are not inserted into tiie location 
table, tiie time it takes to re-write the location table is 
minimized and rewriting of the location table is a single 
record. Therefore, tiie range of any exclusion that may 

40 occur is radically minimized and tiie possibility of a 
deadlock is greatiy reduced. 
[0006] A deadlock occurs when two different tasks on 
a single computer perform exclusion operations on two 
or more identical resources in different sequences but 

45 does not occur if the sequence of exclusion is identical, 
and narrower ranges of exclusion reduce the probability 
of deadlock. Conventional indices are made up of multi- 
ple levels. When a record is accessed and a record's 
key is modified when an index is updated, tiie index is 

50 also modified. When this affects the lowest-level index 
alone, the exclusion range is restricted, but in some 
cases this may affect higher-level indices. Since exclu- 
sion affects a range of multiple records and it takes a 
great deal of time to update tiie indices in such cases. 

55 exclusion takes up long periods of time and is a cause 
of multiple instances of deadlock. 
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OBJECTS AND ADVANTAGES 

[0007] Accordingly, several objects and advantages of 
the invention are: 

(a) To provide high-speed random access by using 
primary keys and alternate keys; 

(b) To provide high-speed sequential read and write 
operations; 

(c) To provide minimal modification caused by 
updating, modification and deletion of data, short 
processing time, and high-speed storage by using 
location tables and alternate-key blocks instead of 
Indices; 

(d) To provide creation (generation) of location 
tables and alternate-key blocks in shorter times 
than are required for the creation of indices; 

(e) To eliminate the necessity of creating blocks in 
advance: 

(f) To provide the ability to add blocks until no phys- 
ical space remains on a storage medium; 

(g) To provide record-compression capabilities 
deriving from the capability of handling variable- 
length records and record insertion (though in most 
cases compressed records cannot be updated); 

(h) To provide the capability of using multiple alter- 
nate keys; and 

(i) To provide greatly reduced database mainte- 
nance time. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0008] 

FIG. 1 illustrates the structures of a location table 
and a final pointer, either or both of which may be 
Implemented with different lengths, the number of 
bytes given for each being for illustrative purposes 
only. 

FIG. 2 Illustrates the relationship between a loca- 
tion table and its blocks. 

FIG. 3 illustrates the structures of an alternate-key 
table (and its entries) and an alternate- key block, 
either or both of which may be implemented with 
different lengths, the number of bytes given for each 
being for illustrative purposes only, in which the 
FROM and TO segments hold the minimum and 
maximum key values, respectively, of the block and 
the alternate-key block may hold both FROM and 
TO values or it may hold only one of these. 

FIG. 4 illustrates the structure of a block. 

FIG. 5 illustrates the structure of a pre-alternate-key 
block. 



FIG. 6 illustrates the relationship between primary 
blocks and overflow blocks In which the blocks 
occupy a contiguous region in order to facilitate 
understanding, but there being no requirement that 
5 they do so in practice. 

FIG. 7 illustrates an e)Qmple of keys applied to text 
data. 

10 DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

[0009] Records stored with this system are restricted 
to records having a single unique primary key and none 

15 or one or more non-unique (though these may be 
unique) keys (alternate keys). The system does not 
accommodate records that do not have a unique key 
[0010] However, a serial number or otiier unique key 
may be forcibly applied to a record not having a unique 

20 key when such a record is added and reading per- 
formed only in the physical order of records or in the 
order of their alternate keys. Since insertion is not thus 
involved, an overflow block is required only when updat- 
ing a record increases the length of the record. 

25 [001 1 ] In the description following, "addition" refers to 
storage of a record having a primary key larger than the 
largest primary key of records currently stored, and 
"Insertion" refers to storage of a record having a primary 
key smaller than the largest primary key of records cur- 
se? rently stored. 

[001 2] Rrst, we will describe the storage system. The 
size of a location table is calculated from tiie number of 
records planned to be stored, the size of their blocks, 
and the number of primary-key blocks per record in the 

35 location table, and a contiguous region sufficient for this 
size is secured in a storage medium. Similarly, the size 
and number of alternate-key blocks is determined and a 
contiguous region sufficient for this size secured so as 
to enable storage of entries in the number of records 

40 also storing alternate-key blocks. However, in the event 
storage exceeds tiie number originally assumed, there 
is a possibility that tiie contiguous region may fill up, 
making further storage impossible. In sudi cases, an 
additional contiguous storage region is secured and an 

45 address-conversion table used to treat multiple contigu- 
ous regions as though they were one contiguous region, 
thus allowing tiie system to accommodate situations in 
which the number of records stored exceeds the 
number originally assumed. When tiiere are multiple 

50 alternate keys, a region is secured for each one. There 
is no need for the alternate-key blocks corresponding to 
different alternate keys to be contiguous. FIG. 2 illus- 
trates the relationship between a location table and its 
blocks. 

55 [0013] When using a storage system split into sub- 
ranges, a location table is created for each sub-range in 
a size suited to tiie number of records intended to be 
stored in the sub-range. Each location table must be in 
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a contiguous region, but all location tables need not be 
In a single contiguous region. An alternate-key block is 
set up not split across regions, but in a contiguous 
region of a size con-esponding to all records. 
[0014] When storing the first record, first a final pointer 
is referenced in exclusion mode. The final pointer con- 
trols how far the block and location table are used and 
so has the format Illustrated in FIG. 1 . Since there are 
no stored records to begin with, the first block is regis- 
tered in the final pointer as the final block. The block 
number and primary-key value are registered in the final 
pointer. Next, the first primary block is secured in exclu- 
sion mode, and its physical address and block number 
(0 in this case since numbering starts from 0) are regis- 
tered. When a block is secured on disk, its entry 
includes the value of its primary key Next, records are 
registered In that block. Then ail exclusion is released. 
[0015] To register the second record, first the final 
pointer is referenced in exclusion mode and it is deter- 
mined whether the primary key is greater than the value 
of the key in the final pointer. First we will describe how 
a record is added. Since the block number is 0. block 
number 0 in the location table is referenced in exclusion 
mode, the physical location of block 0 obtained and 
block 0 at that physical location read. If there is sufficient 
space in that block, the record is stored, the value of the 
primary key registered in the final pointer and all exclu- 
sion released. 

[0016] Subsequent additions are performed in like 
fashion. Sometimes the space in block 0 will not be suf- 
ficient to store an additional record. If block 0 lacks suf- 
ficient space when the above operation is performed 
and block 0 read on the addition of record number m, a 
single primary block (block number 1) is secured in 
exclusion mode. Record number m is then stored in 
block 1. The second record in the location table is then 
referenced in exclusion mode, and the physical location 
of primary block 1 registered in this record. Ail exclusion 
is then released. In this way additions are stored in the 
location that physically follows the last record. When 
split across sub-ranges, the same operations are per- 
formed for each sub-range. 

[0017] Next we describe how a record is inserted. 
Assume that multiple location-table records, primary 
blocks and data records already exist. 
[0018] It is first of all necessary to determine which 
block the record inserted should be stored in. 
[0019] This is done by searching the location table. A 
binary search is one example of a high-speed search 
method. Though a binary search method is used here 
as an example, other methods may also be used to find 
a target entry. The method used here is to find dichoto- 
mous points and compare the value of the primary key 
of the record stored in that block (abbreviated below as 
"stored primary-key value"), including the overflow block 
if the primary block that record points to has an overflow 
block, with the value of the primary key of the record 
inserted (abbreviated below as "inserted primary-key 



value"). If the inserted primary-key value is greater than 
the smallest of the stored primary-key values of the 
block or blocks and smaller than the smallest of the 
stored primary-key values in the next blocK it is stored 

5 in the first of those two blocks. Othenvise. the sizes of 
the stored primary-key values of that block and the 
insertion-block primary-key value are again compared, 
dichotomous points are obtained in the former if the 
inserted primary-key value is smaller and in the latter if 

10 the inserted primary-key value is not smaller, and like 
operations performed to identify the block to be used for 
storing the record. 

[0020] If the location table Is composed of multiple 
contiguous regions, a binary search cannot be per- 

15 formed as is. but using an address-conversion table to 
treat it as though it were a single contiguous region 
allows a binary search to be performed. 
[0021 ] If the location table is split across sub-ranges, 
first a comparison is made witii the value of either the 

20 initial or the final primary key of the location table in 
each sub-range to determine which sub-range holds tiie 
target record. A binary search Is then performed on that 
sub- range in like manner as described above to identify 
the block that holds tiie target record. 

25 [0022] If the record is to be stored in block number n, 
the following shall apply The location witiiin the block 
where the record is to be inserted is identified. Since 
records are arranged in the order of their primary keys, 
the location of the insertion will be that location immedi- 

30 ately prior to the record having a primary key larger tiian 
the record being inserted. First, a check is performed for 
duplication of the primary keys, ff tiiere is any duplica- 
tion of primary keys, an error is output since the storage 
cannot be performed. If there is no duplication of pri- 

35 mary keys, one or more records located behind the 
record being inserted are moved reanvards to create a 
space exactly equal to the size of the record being 
inserted. If tiie records thus moved fit within the block, 
insertion is completed, but if those records do not fit 

40 within the blocK a single overflow block is provided, a 
pointer to it provided from the primary block, and as 
many records stored in the overflow block as necessary 
Then the record to be inserted and any tiiat follow it are 
stored in the primary block. FIG. 6 illustrates tiie logical 

45 relationship between primary blocks and overflow 
blocks. 

[0023] If an overflow block already exists, it is to be 
preferred for it to be possible to store the record combin- 
ing a primary block and its overflow block. There is also 

50 a possibility that only a part of an overflow block Is used 
and the region not used efficientiy In order to avoid 
such, a single overflow block may be provided for multi- 
ple primary blocks. It is also possible to provide all over- 
flow blocks in an identical size smaller than that of their 

55 primary block. It is also possible use a single overflow 
block by pointing to it from multiple primary blocks. 
[0024] Another method that may be adopted is to 
store a record that overflows in an independent storage 
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region and provide a pointer to it from its primary block. 
However, if this method results in a large number of 
overflow records, it entails the disadvantage of long 
retrieval times compared with a method using overflow 
blocks. In practice, the storage method selected should 
be that suited to the number of records that are to be 
generated. 

[0O25] Next, we describe storage and updating with 
alternate keys. Alternate-key tables are stored in an 
alternate-key block in the order of their alternate keys. 
Entries in an alternate-key table consist of an alternate 
key. the physical address of the block where the record 
of that key value is stored, and the primary key of the 
record of that key value. The number of entries in an 
alternate-key table changes when its records are added 
or updated, but there is a high possibility that an 
increase in the number of entries will result in entry 
insertion and a very low possibility that such increase 
will result in entry addition. Therefore, the management 
methods used for primary keys are not appropriate. If 
there are already more stored records than the final 
number planned. Insertions can be processed efficiently 
by providing a space of defined size when storing an 
entry in an alternate-key block, but if the initial number 
of records is less than the final number of stored records 
planned, key insertions will result in a multiple overflow 
alternate-key blocks. A pre-alternate-key block is used 
in such cases. 

[0026] A pre-alternate-key block has the same struc- 
ture as an alternate-key block, and the number of pre- 
alternate-key blocks is the size that can accommodate 
entries in the number of alternate-key blocks divided by 
the size of the pre-alternate-key blocks. When the 
number of entries in a pre-alternate-key block becomes 
equal to the number of alternate-key blocks, these 
entries are moved from the pre-alternate-key block to an 
alternate-key block. When such entries are moved, in 
principle a single entry is stored in any given alternate- 
key blocK but entries having an identical alternate key 
are stored in the same alternate-key block. If the 
number of entries having an identical alternate key is 
too large to store them in the corresponding alternate- 
key block, they are stored in an added alternate-key 
overflow block. 

[0027] For example, assume that the final number of 
stored records planned is one million records. If 100 
entries can be stored in a single alternate-key block. 
10,000 alternate-key blocks will be required. The entries 
are stored in a pre-alternate-key block until there are 
10.000 of them, and when the number of entries 
reaches 10,000. the entries are transferred to an alter- 
nate-key block. 

[0028] If there is a large number of alternate-key 
blocks and a single level of pre-alternate-key blocks is 
provided, there is a possibility that this will result in a 
large number of pre-alternate-key blocks, a high fre- 
quency of insertions and inefficient updating. In such 
cases, multiple levels of pre-alternate-key blocks are 



provided. Taking the example described above, 10,000' 
entries will be stored in a pre-alternate-key block, but 
this pre-alternate-key block is provided two levels since 
100 entries can be stored in a single blocK 100 entries 

5 are managed in the first pre-alternate-key block and 
when the number of entries in that first pre-alternate-key 
block reaches 100. they are transferred to the second- 
level pre-alternate-key block. 
[0029] FIG. 5 illustrates an example of such a transfer 

10 in which a two-level pre-alternate-key block is provided. 
[0030] Next, we will describe retrieval of records using 
primary keys. This operation is performed in the same 
fashion used to determine the insertion location when a 
record is inserted. TTie example of retrieval described 

75 here uses the same binary-search method used in the 
example described for insertion. First, dichotomous 
points are found in the location table, and the value of 
the primary key of the record stored in that block (abbre- 
viated below as "stored primary-key value"), including 

20 the overflow block if the primary blodi that record points 
to has an overflow blocK is compared with the value of 
the primary key of the target record (abbreviated below 
as "target primary-key value"). If the target primary-key 
value is greater than the smallest of the stored primary- 
's key values- and smaller than the smallest of the stored 
primary-key values in the next blodc, either the target 
record exists in that block or the record of that key value 
does not exist in the file. Since records in a block are 
arranged in the order of their primary keys, searching 

30 the block can detect the target record or confirm 
whether the record does not exist in the file. Othenvise, 
the sizes of the stored primary-key values of that block 
and the primary-key value of the target record are again 
compared, dichotomous points are obtained in the 

35 former if the target primary-key value Is smaller and in 
the latter if the target primary-key value is not smaller, 
and like operations performed to identify the block stor- 
ing the record. 

[0031 ] Next, we will describe retrieval using alternate 
40 keys. Retrieval with alternate keys is performed by 
searching alternate-key blocks. The binary search 
method is typically used, but the discussion is omitted 
here since it is described above with respect to retrieval 
using primary keys. The alternate-key block including 
45 the target alternate key is identified. Then the target 
alternate-key table in the alternate-key block is identi- 
fied. The result is that, as when using primary keys, 
either the entry exists in that block or a record with that 
aiternate-key value does not exist in the file and the 
50 entry does not exist. If the alternate-key block has any 
alternate-key overflow blocks, all such blocks are 
searched. 

[0032] If the alternate-key table (or entry) having the 
target key value is identified, the physical block is 
55 accessed from the physical block number in that entry 
and the record in that block identified that matches the 
primary-key value in the entry. And since alternate keys 
may be non-unique, the next entry in the alternate-key 
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block is examined. If it lias an identical alternate-key 
value, the record corresponding to that entry is also 
retrieved, and this operation repeated until no entry has 
an identical key value. 

[0033] Next, we will describe generation. Generation s 
may be useful in a variety of situations, among them 
when multiple records already exist in this system's file 
and regeneration Is required for such reason as an 
increased number of overflow blocks, when restoring! 
infornrtation from a back-up medium to the medium onl io 
which this system has been implemented, and when) 
moving records stored by a method other than that 
described here to this system. Any of these may be 
effected by the same means. 

[0034] For regeneration, files are read in the order of is 
their primary keys with a sequential access method and 
a sequential file is created. A sequential file is similarly 
created for generation. 

[0035] Next, the location tables and alternate-key 
block are to be prepared. The number of location table 20 
entries is obtained by dividing the number of records 
planned to be stored by the number of records that can 
be stored in a single block, and that amount of space is 
secured in a contiguous region. Alternate-key blocks 
are secured for each type of alternate key All alternate- 25 
key blocks for a given type of alternate key have the 
same size, and the number of such blocks is determined 
thus: The number of entries (A) that can be stored in a 
single alternate-key block is obtained, and the number 
of records planned to be stored is divided by A to obtain 30 
the number of alternate-key blocks. Pre-alternate-key 
blocks are also to be prepared. 
[0036] Records are stored in a block in the order of 
their primary keys. The frequency of insertions may be 
estimated in advance or calculated from statistical data, 35 
and a certain proportion of empty space provided within 
a given block. This also allows for instantaneous stor- 
age of records. This is determined by how often inser- 
tions are to be performed. The proportion of empty 
space provided within a block may vary from block to 40 
block. When a record is stored, an entry is written to the 
location table. If a record has an alternate key, first an 
alternate-key entry is created from the record and 
stored in a sequential file. When alternate-key entries 
have been created for all records, the entries are sorted 45 
in the order of the alternate keys and the sorted entries 
stored in the alternate-key block. The number of records 
generated is divided by the maximum planned number 
of records, and storage is performed in the alternate- 
key block in the proportion corresponding to that quo- so 
tient. This is because the alternate keys are inserted 
rather than added. 

[0037] An alternate key may also be put to special 
use. Since keys are currently assigned to fields stored in 
a specific location in a record, they have an identical ss 
length and format, such as a product code or a cus- 
tomer code. Since keys and records may be noade rela- 
tional in this system, keys may be created for text data. 



Keys may also be created for records of non-specific 
format, as illustrated in FIG. 7. As in this example, alter- 
nate keys may be assigned to records not having fields 
of uniform location or fixed field lengths. 

Claims 

1 , An information storage system for computers, com- 
prising: 

a multitude of blocks of fixed length for storing 
a multitude of records, each having a unique 
key (a key that does not duplicate the key value 
of another record, hereafter called the "primary 
key"), having zero or one or more non-unique 
keys (keys that may duplicate the key values of 
different records, hereafter called "alternate 
keys"), and that are stored in said blocks in the 
order of their respective primary keys; 

structure such that said blocks consist of pri- 
mary blocks and overflow blocks, said records 
to be stored first of all in said primary blocks; 

a facility for allocating an overflow block in the 
event an inserted record can not be stored in a 
primary block and for allocating further over- 
flow blocks in the event an inserted record can 
not be stored in a single overflow block, the 
record then to be stored serially across said 
overflow blocks; 

a facility for allocating a new primary block in 
the event an added record can not be stored in 
the final primary block, the record then to be 
stored in said new primary block; 

a location table, used to manage the locations 
of said primary blocks such that blocks may be 
positioned with no restriction whatsoever on 
their physical locations; 

a facility to create blocks as they become 
required until physical data storage area is full 
such that each block need not be created in 
advance; 

a facility to partition files consisting of multiple 
record insertions after multiple specified pri- 
mary keys into multiple sub-ranges at the loca- 
tion of the file's insertion, this operation treated 
as the addition rather than the insertion of 
records, thus preventing the generation of over- 
flow records; and 

providing retention of said primary and over- 
flow blod^ partitioned across multiple comput- 
ers. 
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2. An information retries^al system, operating upon the 
information storage system of claim 1 and compris- 
ing: 

a facility for using location tables and alternate s 
tables, contiguous space for each of which is 
secured in advance in sizes corresponding to 
the number of records stored: 

a location table containing the number of said io 
location table (serial numbers beginning with 
the number zero that are identical to the pri- 
mary block numbers) and the physical address 
of the primary block as a single entry, having a 
structure such that they are stored in the order is 
of their location table numbers; 

a facility whereby retrieval by means of primary 
key is performed by searching the location 
table, the target record so retrieved by identify- 20 
ing the block including the target key value and 
then retrieving the record in that block; 

inclusion in individual location table entries of 
the primary key of the corresponding block in 25 
the event a random-access memory media 
device is used to store records, block searches 
to be performed by means of the location table 
alone; 

30 

a facility for creating alternate-key tables when 
alternate keys are used, said alternate-key 
tables to be stored in alternate-key blocks, 
each capable of storing multiple entries, and 
space for said alternate-key blocks secured in 35 
advance in identical sizes and in the quantity 
required; 

a structure whereby each said entry is made up 
of the number of the block in which an alternate 40 
key and the corresponding record are stored 
and the primary key of that corresponding 
record; 

a structure whereby said entries are stored in 45 
the ascending order of their alternate keys; 

a facility for treating multiple records existing 
within a single alternate key as a single entry; 

50 

a structure whereby entries having identical 
alternate keys are stored in the identical alter- 
nate-key block; 

a facility for storing entries in an alternate-key 55 
overflow block added to said alternate-key 
block in the event an entry or entries can not be 
stored in an alternate -key block because a 



large number of entries have an identical alter- 
nate key or due to the insertion of an alternate 
key; 

a facility for retrieval by means of alternate keys 
such that the entry containing a target alter- 
nate-key value is identified by searching said 
alternate-key blocK obtaining from that entry 
the block number in which the record is stored, 
and retrieving the corresponding record within 
that blocK and such that entries stored in both 
alternate-key blocks and atternate-key overflow 
blocks are retrieved in tiie event said alternate- 
key blocks alone are subjected to searching 
and an alternate-key overflow block exists for 
the alternate-key block searched; 

a facility for creating pre-alternate-key blocks of 
a structure identical to alternate-key blocks in a 
quantity obtained by dividing the space capa- 
ble of storing the entries of the number of alter- 
nate-key blocks by the size of the blocks; 

a facility such that when there are extremely 
few data records compared to the final number 
of storage records and consequently a high 
probability that addition of a further record may 
frequently result in the insertion of an alternate 
key, additional records are not stored first In an 
alternate-key block con'esponding to the final 
number of records, but stored in pre-alternate- 
key blocks until the number of records reaches 
the number of alternate-key blocks; 

a facility for moving entries from pre-alternate- 
key blocks to the alternate-key blocks at the 
point when the number of pre-alternate-key 
block entries becomes equal to the number of 
alternate-key blocks, and whereby a single 
entry is, in prindple, stored in each alternate- 
key block when said entries are moved, but 
entries having kientical alternate keys are 
stored in Identical alternate-key blocks; 

a facility for storing entries in an alternate-key 
overflow block added to an alternate-key block 
when said alternate-key block can not hold said 
entries because a large number of said entries 
have an identical alternate key; 

a facility for sequential reading by means of a 
primary key performed by detecting the pri- 
mary physical block address in the location 
table and sequentially reading the record in 
that primary block and, if an overflow block or 
blocks exist, the record in said overflow block or 
blocks, said subsequent block to be deter- 
mined by obtaining the next entry in the loca- 
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tion table and obtaining the physical block from 
said entry; and 

a facility for sequential access by means of an 
alternate key that first reads the first record as s 
described above and reads the next record by 
means of sequential access of the record of the 
next entry in the alternate-key block. 
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Figure 1 
Location Table 



Block No. 


Physical Address 




8 bytes -> 


8 bytes -* 










Final Pointer 
Blocic No. 


Physical Address 




Primary key 


«- 8 bytes -* 


8 bytes 


Any length 
(Maxiraum key value in final block) 



Sub-ranee Location Table 



Sub-Range No. 


Sub-range Block No. 


Physical Address 




<- n bytes 


(8 - n) bytes 


«- 8 bytes 














Sub-range Final 
Sub-range No. 


Poi nter 

Sub-range Block No. 


Primary Key 




<- n bytes 


*- {8 - n) bytes ^ 


<- Any length 

(Maximum key value in final block) 


n bytes -> 


<- (8 - n) bytes -+ 


Any length 
(Maximum key value in final block) 
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Figure 2 
Location Table 



Block 0 


Physical ^ 
address 


Block 1 


Physical 
address 


Block 2 


\ 


Block 3 


— ^ 







Blocks 
Block 0 



Block 1 



Block 2 



Block 3 
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Figure 3 

Alternate Key Entry 



Alt Key 


Block No. 


Primary Key 


<- Any length -> 


<- 8 bytes 


*— 


Any length 


— > 









Alternate Key Block 



Block 
No. 



=roB To 



Primary 



Overflow 



From To 



Al ternate key 
entry-1 



Al ternate key 
entry-2 



Overflow 
al ternate 
key 
block 
address 
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Figure 4 



Block 



Black 
No. 



Primary 



From 



To 



Overt low 



From 



To 



Record 1 



Record 2 



Prra 
key 



Alt 
key 



Prm 
key 



AM 
key 



Overflow 

block 

address 
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Figure 5 



Level 1 Keys in Pre-AI ternate Key Table 












4 8 




5 5 




5 9 
















> 










6 4 




6 5 




6 9 
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Figure 6 



Primary Block 0 




Prinary Blocic 1 


Primary Blocic 2 


Prinary Block 3 


Prinary Block 4 


Overflow Block 4-0 


Primary Block 5 




Primary Block 5 


Primary Block 7 


Primary Block 8 


Overflow Block 8 - D 


Primary Block 9 


Overflow Block 9-0 


Primary Block 10 




Primary Block 11 


Primary Block 12 


Primary Block 13 


Primary Block 14 
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Figure 7 

Text Data Keys 

0 0 0 1 i £0 Is school today . 

©@{D ® (D 

0 0 0 2 I wi 1 1 relax at home tomniorow . 

© ® ® ® ® @ 



The primary keys are 00Q1 and 0002. All keys are stored in block 155. 



Alternate key value 


Pointer (Primary key value, Block No. t Displacement of key) 


relax 


0 0 0 2. 1 5 5. 7 


home 


0 0 0 2. 1 5 5. 1 6 


school 


0 0 0 1 . 1 5 5. 8 


at 


0 0 0 2. 1 5 5, 1 3 


80 


0 0 0 1. 1 5 5. 2 


today 


0 0 0 1. 15 5. 1 5 


will 


0 0 0 2. 1 5 5. 2 


1 


0 0 0 1. 1 5 5. 0 


tmmorow 


0 0 0 2. 1 5 5. 2 1 


to 


0 0 0 1. 1 5 5. 5 


1 


0 0 0 2. 1 5 5. 0 
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